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Abstract: A severe atypical pneumonia designated as severe acute respiratory syndrome (SARS) by The World Health 
Organization broke out in China and menaced to more than other 30 countries between the end of the year 2002 and June 
of the year 2003. A novel coronavirus called severe acute respiratory syndrome coronavirus (SARS-CoV) has been re- 
cently identified as the etiological agent responsible for the infectious SARS disease. Based on extensively scientific co- 
operation and almost two-year’s studies, remarkable achievements have been made in the understanding of the phyloge- 
netic property and the genome organization of SARS-CoV, as well as the detailed characters of the major proteins in- 
volved in SARS-CoV life cycle. In this review, we would like to summarize the substantial scientific progress that has 
been made towards the structural and functional aspects of SARS-CoV associated key proteins. The progress focused on 
the corresponding key proteins’ structure-based drug and vaccine developments has been also highlighted. The concerted 
and cooperative response for the treatment of the SARS disease has been proved to be a triumph of global public health 
and provides a new paradigm for the detection and control of future emerging infectious disease threats. 
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INTRODUCTION 


From the end of the year 2002 to the June of the year 
2003, one severe epidemic disease called severe acute respi- 
ratory syndrome (SARS) broke out in China and quickly 
spread to more than 30 other countries (http://www.who.int/ 
csr/sars/country/en/). In fact, the SARS disease ever severely 
menaced the worldwide population and totally more than 
8400 patients and 789 deaths were reported by the World 
Health Organization (http://www.who.int/csr/sars/country/ 
en/). This emergent status urged a tightly worldwide collabo- 
ration in dealing with such a disease. With many efforts in 
the extensively scientific cooperation, one previously un- 
known coronavirus designated as SARS coronavirus (SARS- 
CoV) was determined as the etiological agent of this severe 
infection and believed to be a novel human coronavirus 
possibly originated from non-human host [1]. 


The coronaviruses are a diverse group of enveloped, 
positive-stranded RNA viruses that cause respiratory and 
enteric diseases in humans and other animals [2]. Genome 
sequencing analysis has revealed that SARS-CoV involve 
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the largest viral RNA genome known to date, encompassing 
29,727 nucleotides predicted to contain 14 functional open 
reading frames (ORFs) [1]. The two large 5'-terminal ORFs, 
la and 1b, constitute the replicase gene encoding the pro- 
teins (PL2", 3CL?” and a series of non-structural proteins) 
required for viral RNA transcription and replication and the 
remaining twelve ORFs encode the four key structural pro- 
teins, the spike (S) protein, the nucleocapsid (N) protein, the 
membrane (M) protein and the small envelope (E) protein, 
and other eight accessory proteins that are not likely to be 
essential in tissue culture but may provide a selective ad- 
vantage in the infected host [3]. The phylogenetic analyses 
of the major structural proteins indicated that SARS-CoV 
does not closely resemble any of the previously known three 
groups of coronaviruses [4]. Accordingly, the elucidation of 
the structural and functional characters of the major proteins 
encoded by SARS-CoV genome has become an alluring 
project, and the corresponding studies have been extensively 
reported. With the resolved crystal or homology modeling 
structures and corresponding biological function determina- 
tions of these major proteins, a series of structure-based drug 
and vaccine developments have been carried out by using 
the virtual screening and the in vitro putative inhibitors 
screening platforms. In this review, we would like to sum- 
marize the current understandings of the structural and func- 
tional aspects of some major SARS-CoV proteins. The cor- 
responding drug and vaccine development for the treatment 
of the SARS disease has been also presented in this review. 


© 2006 Bentham Science Publishers Ltd. 
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MAJOR PROTEINS ENCODED BY SARS-COV 
GENOME AND CORRESPONDING DRUG DEVEL- 
OPMENT 


3C-Like Protease (3CL""’) 


SARS-CoV genome has been predicted to contain 14 
functional open reading frames (ORFs) for encoding the 
proteins required for virus replication and transcription [5]. 
Two large 5’-terminal ORFs, la and 1b, constitute the repli- 
case gene that encodes two overlapping polyproteins, ppla 
(around 450kDa) and pplab (around 750kDa) necessary for 
viral RNA synthesis. Proteolytic processing of the non- 
structural polyproteins is a vital step in the replication cycle 
of coronavirus and such processing is commonly performed 
by virus-genome encoded proteases [6,7]. Polyproteins ppla 
and pplab are cleaved extensively by a papain-like cysteine 
protease (PL2’"°) and another chymotrypsin-like protease to 
yield a multi-subunits protein complex called “viral repli- 
case-transcriptase” [5,8,9]. The chymotrypsin-like protease 
was named as 3C-like protease (3CL’®) due to its distant 
relationship with the 3C proteases of picornaviruses [8]. 
Since 3CL”® functions as a pivotal protease in coronavirus 
polyproteins processing and controls the activities of coron- 
avirus replication complexes, it is also called main protease 
(M?"°). Due to its functional indispensability in coronavirus 
life cycle, SARS-CoV 3CL?” has become an attractive target 
in discovering new anti-SARS agents [9]. 


The molecular cloning, expression, purification, and 
mass spectrometric characterization of SARS-CoV 3CL?® 
have been reported [10], and sequence alignment indicated 
that SARS-CoV 3CL?®° shares high homology with the 
3CL?s of other coronaviruses such as transmissible gastro- 
enteritis coronavirus (TGEV) and human coronavirus 
(HCoV) 229E [9,11]. The X-ray crystal structures of the 
3CL’"s of TGEV, HCoV 229E and SARS-CoV revealed 
that the 3D structures of 3CL proteases are more conserved 
compared with their sequences [9,12]. Structurally (Fig. 1), 
SARS-CoV 3CL?® contains three domains: domains I and II 
(residues 8-101 and residues102-184) have six-stranded 
antiparallel B-barrel forming a chymotrypsin fold, which 
resemble the architecture of chymotrypsin and picornavi- 
ruses 3C proteases. The substrate-binding site is located in a 
cleft between these two domains, and residues 201-303 form 
an antiparallel globular cluster of five a-helices (domain IIT) 
connecting to domain II by a long loop region (residues 185- 
200). The 16-residue loop region has been implicated to play 
a role in substrate-binding as indicated from the reported 
biochemical data and the crystal structure [9,12-14]. The 
crystal structure analysis shows that SARS-CoV 3CL?° 
forms a dimer with the two molecules oriented perpendicular 
to one another (Fig. 1), similar to the TGEV and HCoV 
3CL”” structures [9,12,13]. The contact interface predomi- 
nantly involves interactions between domain II of one 
monomer and the N-terminal residues of the other monomer. 
The N-terminus of one monomer (residues 1-7) is squeezed 
into between domains II, III of the parent molecule and 
domain II of the other monomer resulting in a number of 
highly specific interactions within the dimeric structure. The 
influence of N-terminal residues 1-7 deletion on the dimeri- 
zation features of SARS-CoV 3CL?” has been quantitatively 
evaluated by using biochemical and biophysical techniques 
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[15]. The results revealed that the N-terminal deleted prote- 
ase still remains a dimer/monomer mixture within a wide 
range of protein concentrations, which is similar to the full- 
length protease. The equilibrium dissociation constant (Kiss) 
of the N-terminal deleted protease dimer (262 1M) is very 
similar to that of the full-length protease dimer (227 UM) 
(Table 1). However, the N-terminal deletion results in almost 
complete loss of enzymatic activity of the protease. Com- 
plementary molecular dynamics (MD) and docking simula- 
tions demonstrated that the N-terminal deleted protease 
dimer adopts a state different from that of the full-length 
protease dimer, which increases the angle between the two 
monomers and reduces the binding pocket that is not benefi- 
cial to the substrate binding (Fig. 2). These results thereby 
suggested that the N-terminus is not indispensable for the 
protease dimerization, but may fix the dimer in its active 
state that is vital to the enzymatic activity [15]. Furthermore, 
domain III has been predicted to be potent in substrate rec- 
ognition and responsible for positioning the N-terminus of 
one monomer to interact with the active site of the other 
monomer. The critical role of domain III in dimerization and 
enzymatic activity of SARS 3CL?° has been also demon- 
strated [14]. As the crystal structures of 3CL’”® in different 
coronaviruses give the similar dimeric structures and nearly 
all the side chains of 3CL’”° involved in the formation of the 
dimer are conserved [9,12], it is believed that the dimer 
might be the biological functional form of 3CL° and the 
dimerization interfacial region has been suggested to be a 
possible target for rational drug design against SARS-CoV 
[14,16,17]. 


Fig. (1). The solved crystal structure of SARS-CoV 3CL?” 
dimer. 


More detailed structural characters of SARS-CoV 3CL?” can be 
seen from reference [12]. 


The SARS-CoV replicase gene is encoded by two large 
5’-proximal ORFs, la and 1b. Polyproteins ORFla and 
ORF 1b are connected by a ribosomal frameshift site, result- 
ing in the translation of ppla and a carboxyl-extended pro- 
tein pplab. As a non-structural protein, SARS-CoV 3CL*”® 
cleaves ppla and pplab precursors at 11 sites to release a 
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Table 1. 


SARS-CoV 3CL”” 


Full-length 


AB? aim’ 
(kcal/mol) 
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Thermodynamic Parameters for SARS-CoV 3CL”"’’s Dimerization at 25°C and pH 7.5 


AS° aim” 
(cal/mol*eK) 


AG° aim? 
(kcal/mol) 


-11.124+0.163 


N-terminal deleted 


-6.758+0.729 


“From non-linear regression of calorimetric dilution data, fitted by a simple dimer-monomer dissociation model; *Calculated from the general thermodynamic equation, AG? jim = RT 


In Kiss = AH? gim—TAS°gim. Data obtained from Chen et al. [15] 


series of non-structural proteins, including RNA-dependant 
RNA polymerase (RdRp), ATPase/helicase and other func- 
tion-unidentified non-structural proteins (nsp’s) [3,18]. 
These proteins are responsible for replication of the virus 


Fig. (2). The dimerization state change of the N-terminal de- 
leted SARS-CoV 3CL?” dimer [redrawn from ref. 15]. 


Superposition of the N-terminal deleted SARS-CoV 3CL"® dimer 
(monomers A and B are represented as black, respectively) with the 
full-length protease dimer (light gray). To clearly demonstrate the 
dimerization state change, the two monomer Bs of the full-length 
and N-terminal deleted proteases were superposed each other with 
the smallest root mean-square deviation (RMSD). The arc arrows 
represent the rotation direction of monomer A of the N-terminal 
deleted protease dimer. 


genome, thereby producing nested transcripts that are indis- 
pensable for the synthesis of viral proteins. Similar to TGEV 
and HCoV 229E 3CL proteases, SARS-CoV 3CL® employs 
Cys-145 and His-41 as the catalytic dyad in the catalytic site 
but lacks the corresponding third catalytic site [8,16,19,20], 
which is an acidic residue in chymotrypsin. The substrate 
specificity of SARS-CoV 3CL?” has been well defined and 
the cleavage site in polyproteins [(S, V, T, P)-X- (L, I, F, V, 
M)-QL (S, A, G, N, C)] are highly conserved [18]. The P1 
position of the substrate is exclusively occupied by gluta- 
mine, the bulky hydrophobic residues (mainly leucine/iso- 
leucine) are dominant at the P2 position and Ser, Val, Thr, 
Pro are clearly favored at the P4 position. At the P1’ posi- 


tion, small aliphatic residues (Ser, Ala, Gly, Asn, Cys) are 
found, of which the Ser content is more than 50%. There are 
no highly favored residues at the P3, P2’, P3’, P4’ positions. 
In addition, the secondary structural studies for the putative 
substrates of SARS-CoV 3CL"”® have revealed that the sub- 
strates with more B-sheet like structures tend to be cleaved 
fast [16]. The in vitro trans-cleavage activity of SARS-CoV 
3CL™ was extensively determined by the substrate-analog 
peptide cleavage assay using the conventional RP-HPLC and 
fluorescence-based methods [16,20-22], e.g. the enzymatic 
activity of the protease at different pH values and tempera- 
tures was characterized in detail by fluorescence resonance 
energy transfer (FRET) techniques [22]. The results showed 
the substantial pH and temperature-triggered activity switch 
of SARS-CoV 3CL?, and the systematic site-directed 
mutagenesis analyses revealed that substitutions of His", 
Cys'*, and His!® resulted in complete loss of enzymatic 
activity, while replacement of Met!®? with Ala exhibited 
strongly the increase of the enzymatic activity. This work 
provided valuable information in understanding the catalytic 
mechanism for SARS-CoV 3CL?®, and this FRET-based 
assay might supply an ideal approach for SARS-CoV 3CL?” 
putative inhibitors exploration. Furthermore, the crystal 
structure of SARS-CoV 3CL?® has revealed considerable 
pH-dependant conformational changes, which correlates 
well with the varying trans-activities of 3CL"® at different 
pH values [20, 22]. It is suggested that the reported low in 
vitro activity of SARS-CoV 3CL?” in micromolar level is 
perhaps due to the low amount of the active dimeric form 
under the assay conditions, following the prediction that 
only the dimer is the active form of the protease. Recently, 
the characterization of in vivo cis-cleavage activity of the 
protease has been also presented by the cell-based cis- 
cleavage assay [23]. It is not clear whether SARS-CoV 
3CL™ cleaves itself in cis or trans from the replicase poly- 
proteins precursor; however, once released, dimerization of 
SARS-CoV 3CL?” is necessary for its trans-cleavage activ- 
ity. The extensive interactions of one monomer’s N-terminus 
with domain II of the other monomer seem to be the prod- 
ucts of autoprocessing of precursor monomeric SARS-CoV 
3CL by a tailor-made mechanism along with formation of 
the dimer, which keeps the protease in an conformation suit- 
able for the active site to subsequently act on other cleavage 
sites in the polyproteins ppla and pplab during viral repli- 
cation and assembly. 


SARS-CoV 3CL”” has supplied an attractive and key 
target for anti-SARS reagent discovery. According to the 
3CL” structural models from the solved crystal structures of 
TGEV, HCoV and SARS-CoV, a number of potential in- 
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hibitors of SARS-CoV 3CL?® have been proposed from the 
collections of synthetic compounds, natural products and 
approval drugs by using virtual screening [11,24-26]. In 
vitro fluorescence-based and in vivo cell-based assays have 
been developed for putative SARS-CoV 3CL?” inhibitors 
screening [27-30]. To date, the potential inhibitors of SARS- 
CoV 3CL?” identified from these studies include the HIV-1 
protease inhibitors L-700 and 417 [24], the reverse tran- 
scriptase inhibitors calanolide A and nevirapine [31], the a- 
glucosidase inhibitor glycovir [31], the general antiviral 
ribavirin [31], an natural product sabadinine [25], KZ7088, a 
derivative of AG7088 (a drug to treat the common cold) 
[32], bifunctional aryl boronic acid compounds [28], the 
mercury-containing compounds thimerosal, phenylmercuric 
acetate and hexachlorophene, as well as zinc-conjugated 
compounds [33]. In addition, cinanserin, a well-character- 
ized serotonin antagonist that has undergone preliminary 
clinical testing in humans in the 1960s, was also recently 
discovered by L. Chen, ef al., as one potent inhibitor of 
SARS-CoV 3CL?® [34]. The extensively experimental eval- 
uations demonstrated that both cinanserin and its hydrochlo- 
ride could inhibit the bacterially expressed SARS-CoV 
3CL’ and HCoV 229E 3CL?” by ICs around 5uM (Fig. 3). 
Antiviral activities for cinanserin and its hydrochloride were 
also evaluated in tissue culture assays, and the results re- 
vealed their strong inhibitions against coronavirus replica- 
tion at nontoxic drug concentrations (Fig. 4). This study has 
provided a new strategy in identifying new pharmacological 
activities for the known drugs [34]. 


Spike (S) protein 


Coronavirus Spike (S) protein is a type I membrane 
glycoprotein that has an N-terminal ectodomain, a C-ter- 
minal hydrophobic anchor and an unusual cysteine-rich 
domain bridging the putative junction of the anchor and the 
cytoplasmic tail [8]. On the membrane-bound polysomes, S 
protein is translated and inserted into rough endoplasmic 
reticulum (RER), cotranslationally glycosylated, and trans- 
ported to the Golgi complex. During the transport, S proteins 
are incorporated onto the maturing virus particles, which 
assembly and bud into a compartment that lies between the 
RER and Golgi. Virions are carried from Golgi to plasma 
membrane in the secretory vesicles, and are released from 
cells when virion-containing vesicles fuse with plasma 
membrane. The excess S proteins that are not incorporated 
onto virus particles are transported to the surface of plasma 
membrane. S protein can interact with a cellular receptor and 
mediate membrane fusion to allow viral entry into the sus- 
ceptible target cells [35]. 


SARS-CoV S protein is 1255 amino acids long glyco- 
protein. It is predicted to possess a 13 amino acid signal 
peptide at the N-terminus, a single ectodomain (1182 amino 
acids) and a transmembrane region followed by a short 
cytoplasmic tail (28 residues) at the C-terminus [4] (Fig. 
5A). SARS-CoV S protein is translated as a large polypep- 
tide, which is subsequently cleaved by virus-encoded or 
host-encoded proteases to produce two functional subunits, 
S1 and S2 (Fig. 5A). S1 is known to be the peripheral frag- 
ment and S2 is the membrane-spanning fragment. Both the 
Sl and S2 subunits appear to cause cell fusion when ex- 
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pressed individually, this suggests that they are biologically 
fully active [8]. 
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Fig. (3). Inhibitory activity of cinanserin and its hydrochloride 
on the proteolytic activity of SARS-CoV (A and B) and HCV 
229E 3CL?™ (C and D). 


Inhibition of cleavage was measured by FRET using a peptide 
substrate labeled with a pair of fluorogenic dyes. (A, C cinanserin; 
B, D cinanserin hydrochloride). Data obtained from Chen et al. 
[34]. 


The S proteins of coronaviruses including SARS-CoV, 
associate with cellular receptors to mediate infection against 
their target cells. Recently, Li et al. [36] used a straightfor- 
ward coimmunoprecipitation method to precipitate the virus 
attachment S protein with lysates from Vero E6 cells that are 
susceptible to virus infection, and the coimmunoprecipitated 
proteins were analyzed by mass spectrometric analysis. The 
results showed that a metallopeptidase isolated from the 
SARS-CoV infected Vero E6 cells, called angiotensin- 
converting enzyme 2 (ACE2), could efficiently bind to the 
S1 domain of SARS-CoV S protein, and the soluble ACE2 
form could block the association of $1 domain with Vero E6 
cells. These results thereby indicate that ACE2 is a func- 
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Fig. (4). Inhibition of SARS-CoV replication by cinanserin and cinanserin hydrochloride. 


(A) Reduction of SARS-CoV RNA concentration in supernatant. Vero cells were infected with SARS-CoV at an MOI of 0.01 and virus 
RNA concentration was measured by real-time PCR. Influence of the compounds on cell viability was measured by MTT test. Virus RNA 
measurement and MTT test were performed with cells and supernatant, respectively, of the same cell culture well. Virus RNA concentration 
of untreated cells (4 x 10’ RNA copies/ml) and the corresponding MTT value were defined as 1. Mean and range of duplicate testings are 


shown. 


(B) Reduction of the concentration of infectious particles in supernatant. Supernatant of infected cells treated with 50 ug/ml compounds and 
of cells that were left untreated were harvested 2 days post infectionem and the virus titer was determined by immunofocusassay. The cell 
culture wells of the immunofocusassay inoculated with dilutions of the supernatant are shown. Data obtained from Chen et al. [34]. 


tional receptor of SARS-CoV [36,37]. It is found that the 
smaller S protein fragment of residues 327-510 or 318-490 
did not bind to ACE2, and the 193-amino acid fragment 
(residues 318-510) could bind to ACE2 more efficiently than 
the full S1 domain (residues 12-672)[38]. Further binding 
inhibition assay suggested that the receptor-binding domain 
of SARS-CoV S protein is located between amino acid 
residues 303 and 537 [39]. Direct binding mapping experi- 
ment revealed that amino acids 270 to 510 of SARS-CoV S 
protein are required for the interaction of S protein with the 


receptor of the target cell [40]. The point mutation at Asp454 
might abolish the association of the full $1 domain and the 
193-residue fragment with ACE2, this demonstrates that 
Asp454 is critical for S protein interaction with ACE2 [38]. 
The fact that ACE2 acts as a partner of SARS-CoV S protein 
in mediating virus entry and cell fusion suggested that S 
protein might work as a major antigenic determinant that 
induces neutralizing antibody. The recombinant S protein 
could exhibit the antigenicity and ACE2-binding ability, and 
it should become a good candidate for further SARS vaccine 
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development and anti-SARS therapy [41]. Sui et al. [42] 
found that 80R human monoclonal antibody may be a useful 
viral entry inhibitor for the emergency prophylaxis and 
treatment of SARS, and that the ACE2-binding site of S1 
domain could be an attractive target for subunit vaccine and 
drug development. The siRNAs could also effectively and 
specifically inhibit the gene expression of S protein in 
SARS-CoV-infected cells [43], which indicates that the 
interference of S protein expression could be a powerful tool 
for SARS-CoV inhibition. Yang ef al. [44] found in a mouse 
model that a DNA vaccine encoding SARS-CoV S protein 
may induce T cell and neutralizing antibody responses and 
protective immunity. In addition, the soluble ACE2 and 
various fusion constructs or fragments of ACE2 could also 
serve as potent inhibitors for SARS-CoV infection in vivo 
[45]. In considering the essential role of the immunological 
fragment of the S protein (Ala251-His641, SARS_S1b) in 
SARS-CoV entering the host cells, the thermally induced 
and GuHCl-induced unfolding features of SARS_S1b was 
quantitatively characterized by Yu et al. [46], which re- 
vealed that the secondary structure of SARS_S1b has a 
relatively high thermal stability. Moreover, the secondary 
and three-dimensional structural predictions by homology 
modeling indicated that SARS_S1b folds as a globular-like 
structure by B-sheets and loops and two of the totally four 
tryptophans were located on the protein surface [46]. The 
percentage of o-helix (3%) and B-sheet (35%) for SARS_ 
S1b obtained from the secondary structure prediction in this 
study is extremely close to the data reported by Spiga et al. 
[47], and the 3D model suggested that SARS_S1b is most 
likely to be an all-B-sheet globular protein, coinciding with 
the reported result that coronavirus S1 protein is the globular 
part of S protein [48]. Moreover, the three-dimensional 
model could be also used to explain the published results of 
S1/ACE-2 binding and immunizations, and afford a possible 
platform for further biological study and drug discovery 
targeting the S protein. 


Two regions within S2 domain of SARS-CoV, named 
heptad repeat 1 and 2 (HRI, residues 892-1013 and HR2, 
residues 1145-1194) regions, exhibit a high degree of se- 
quence conservation with other coronaviruses. HR1 and 
HR2 associate with each other into an antiparallel six-helix 
bundle revealed by the resolved crystal structure (Fig. 5B) 
[49], with structural features typical of the other known class 
I fusion proteins [50,51]. The boundaries of the HR1/HR2 
interaction are mapped to residues 896-972 in HRI and 
residues 1142-1188 in HR2, possibly extending a few more 
residues at the N terminus of HR1 (up to residue 889) and the 
C terminus of HR2 (up to residue 1193, which is the pre- 
dicted boundary of the S2 transmembrane domain); and 
these boundaries would position the fusion peptide in the 
region of residues 870-890 [51]. The HR1 domain of SARS- 
CoV S protein forms a continuous helical structure, which 
can be also fully formed in the absence of any interaction 
with the HR2 domain. This result is unlike HIV gp41, in 
which the N-peptides are unable to form a trimeric-coiled 
coil in the absence of the C-peptide [52], and is more similar 
to the influenza virus hemagglutinin [51]. After binding to 
the target cell, the transmembrane spike might change con- 
formation by association between the HR1 and HR2 regions 
to form an oligomeric structure, leading to the fusion be- 
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tween the viral and target-cell membranes [50]. In addition, 
the crystal structure of S2 domain of SARS-CoV S protein 
[49] revealed that the fusion core is a six-helix bundle in 
which three HR2 helices pack against the hydrophobic 
grooves on the surface of central coiled coil formed by three 
parallel HR1 helices in an oblique antiparallel manner (Fig. 
5B). This structure shares significant similarity with the 
fusion core structure of mouse hepatitis virus (MHV) S 
protein and other viral fusion proteins, which suggests a 
conserved mechanism of membrane fusion in coronaviruses. 
The drug discovery strategy aimed at inhibiting viral entry 
by blocking hairpin formation that has been successfully 
used in human immunodeficiency virus 1(HIV-1) inhibitor 
development might be applicable to SARS-CoV inhibitor 
exploration on the basis of the identified structural informa- 
tion [49]. 


Nucleocapsid (N) Protein 


The nucleocapsid (N) protein of SARS-CoV is a struc- 
tural protein that primarily functions in recognizing a stretch 
of RNA that serves as packaging signal and leads to the 
formation of the ribonucleoprotein (RNP). It wraps the ge- 
nomic RNA segment into a helical nucleocapsid that is 
further compacted into a core possibly with icosahedral 
symmetry. SARS-CoV N protein is a highly charged basic 
protein with 422 amino acids (range for other coronaviruses, 
377 to 454) including seven successive hydrophobic residues 
near the middle of the protein. It shares a very low homology 
with the N proteins of other coronaviruses [1]. 


The major biochemical and thermodynamic features of 
SARS-CoV N protein was quantitatively studied by using 
the relevant biophysical and biochemical techniques [53]. 
The results showed that the recombinant SARS-CoV N 
protein is easy to denature at low temperature and with low 
concentration of denaturant, which indicates that this struc- 
tural protein has low stability that might be critical for 
SARS-CoV function and stability [53,54]. SARS-CoV N 
protein tends to form oligomer in vitro, more probably dimer 
at low concentration, and it is suggested that the dimeric N 
protein might act as a basic functional unit in vivo. It has 
been known that N-N self-interactions may be necessary for 
subsequent formation of the nucleocapsid and assembly of 
the viral particles [55]. Noticeably, SARS-CoV N protein 
self-association may be important for initiation of RNP 
formation. As for the fragments involved in the formation of 
dimer or oligomer, however, there are two different opin- 
ions. He et al. [55] reported that a serine/arginine-rich (SR- 
rich) motif (SSRSSSRSRGNSR) between residues 184 and 
196 is crucial for SARS-CoV N protein oligomerization 
since deletion of this region could completely abolish SARS- 
CoV N protein self-multimerization. Nevertheless, Surjit et 
al. [56] demonstrated that the C-terminal 209 amino-acid 
region constitutes the interaction domain responsible for 
self-association of SARS-CoV N protein to form dimers. 
While a recent work [57] has provided an incongruent result, 
which demonstrates that the C-terminal domain of SARS- 
CoV N protein (residues 283-422) has multimeric ability, 
although the full-length protein tends to form dimers. Fur- 
ther study revealed that the multimeric ability of the C- 
terminal domain could by weakened by the SR-rich motif 
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Fig. (5). The predicted overall structure of Spike (S) protein of SARS-CoV [redrawn from ref. 46]. 


(A). Structurally SARS-CoV S protein is predicted to have a 13 amino acid signal peptide at the amino-terminus, a single ectodomain (1182 
amino acids) and a transmembrane region followed by a short cytoplasmic tail (28 residues) at the carboxyl-terminus. As being translated as 
a large polypeptide, the protein can be subsequently cleaved by virus-encoded or host-encoded proteases to produce two functional subunits, 


S1 and S2. 


(B). The crystal structure of SARS-CoV S protein fusion core [redrawn from ref. 49]. 


interaction with the central region (residues 211-290), and 
suggested that the SR-rich motif might play an important 
role in the transformation of SARS-CoV N protein between 
the dimer and multimer for self-association or dissociation. 
Therefore, more attention should be paid in order to eluci- 
date the oligomerization mechanism for SARS-CoV N pro- 
tein considering these published conflicting results. 


Recently, Luo ef al. [58] reported that SARS-CoV N 
protein binds tightly to human cyclophilin A (CypA). CypA 
has ever been shown to play an important role in HIV infec- 
tion. The Gag polyprotein of HIV-1 binds to most members 
of the cyclophilin family of peptidyl-peptide-prolyl isomer- 
ases [59]. However, of the 15 known human cyclophilins, 
only human CypA is integrated inside the viral core of HIV- 
1 by interacting specifically with the capsid domain (CA) of 
the Gag polyprotein [60], and human CypA subsequently 
performs an essential function in HIV-1 replication. Luo’s 
result [58] indicated that SARS-CoV N protein has a binding 
affinity to human CypA by the equilibrium dissociation 
constant (Kp) ranging from 6 to 160nM. The probable bind- 


ing sites of these two proteins were detected by modeling the 
three-dimensional structure of the SARS-CoV N/human 
CypA complex, from which the important interaction residue 
pairs between the proteins were deduced (Fig. 6). Mutagene- 
sis experiments further validated the binding model. Such 
presently observed SARS-CoV N/human CypA interaction 
model might provide a new hint for facilitating the under- 
standing of another possible SARS-CoV infection pathway 
against the human target cell. 


In addition, Luo et al. [61] also discovered another 
binding partner for SARS-CoV N protein. They found that 
SARS-CoV N protein exhibited high binding affinity against 
human heterogeneous nuclear ribonucleoprotein Al (hnRNP 
Al), which is related to the pre-mRNA splicing in the nu- 
cleus and translation regulation in the cytoplasm [62]. The 
results clearly demonstrated that SARS-CoV N protein could 
directly and specifically bind to human hnRNP A1 in vitro 
(Fig. 7), and further in vivo yeast two-hybrid assays (Fig. 8A 
and 8B) indicated that such binding relates to the fragment 
(residues 161-210) of SARS-CoV N and the Gly-rich do- 
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main (residues 203-320) of hnRNP Al [61]. It is suggested 
that both SARS-CoV N and hnRNP A1 proteins are possibly 


\ ee Wi! 


Fig. (6). Schematic representation of the 3D model of SARS- 
CoV N protein (residues 235-369)-human CypA complex [re- 
drawn from ref. 58] 


The binding loop Trp302-Pro310 of SARS-CoV N protein was 
highlighted in red color. Resides shown in ball-and-stick model are 
essential in the SARS-CoV N-human CypA binding, and site- 
directed mutagenesis analyses were performed on these residues to 
validate this binding model. See [ref. 58] for further details. 
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Fig. (7). SARS-CoV N protein/human hnRNP A1 interaction 
determined by GST pull-down Data obtained from Luo et al. 
[61]. 
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within the SARS-CoV replication/transcription complex 
and SARS-CoV N/human hnRNP AI interaction might 
function in the regulation of SARS-CoV RNA synthesis. 
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Fig. (8). Mapping the interaction domain of SARS-CoV N 
protein with human hnRNP A1. 


(A) Schematic description of the truncated fragments and (B) the 
yeast two-hybrid assay results for SARS-CoV N protein/human 
hnRNP AI interactions in their truncated and non-truncated forms. 
The empty vectors pGBKT7 and pGADT7 co-transformed were 
used as the negative control. Data obtained from Luo et al. [61]. 


A high affinity interaction has been reported to exist 
between the N protein and leader/intergenic RNA sequence 
in MHV[63]. The MHV N protein contains two RNA- 
binding domains that interact with the 3' RNA, one is located 
in the amino and the other in the carboxyl terminal regions. 
These two domains are part of the three conserved regions in 
the N proteins of coronaviruses. The smallest MHV N pro- 
tein fragment that retains RNA-binding activity is a 55 
amino acids segment containing residues 177-231, and 
AAUCYYA was identified to be the potential minimum 
ligand for MHV N protein [63]. Actually, SARS-CoV N 
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protein shares a high homology with MHV N protein re- 
garding this domain, thus SARS-CoV N protein is proposed 
to interact with RNA through the same region even though 
there is little literature focusing on it until now. The structure 
of N-terminal RNA-binding domain (residues 49-178) of 
SARS-CoV N protein has been determined recently by nu- 
clear magnetic resonance (NMR) spectroscopy [64]. The 
result showed that this domain consists of a five-stranded 
antiparallel beta-sheet with a folding topology distinct from 
other RNA-binding proteins (Fig. 9). Even though the over- 
all folding is very different from any known protein, the N- 
terminal domain of SARS-CoV N protein exhibits some 
similarities to the typical RNP-binding motif. The central 
strand of the beta-sheet, 83, contains the highly conserved 
sequence RWYFYYLGT, which is found in other reported 
coronaviruses N proteins [1,4]. The RNA-binding site of 
SARS-CoV N protein contains a high content of positively 
charged lysine and arginine residues on its surface, and the 
long flexible B hairpin with its positively charged surface 
may grasp RNA against the B sheet similar to the case in 
U1A RNP RNA-binding protein, where a highly positively 
charged loop between B2 and B3 and the face of the B-sheet 
are involved in RNA binding [64]. It is reported that the 
compound 6-amino-4-hydroxy-naphthalene-2-sulfonic acid, 
which might mimic the phosphate backbone of RNA, could 
interact with this arginines and/or lysines region of SARS- 
CoV N protein, thus serving as one possible lead compound 
that tightly binds to SARS-CoV N protein and subsequently 
inhibits its biological function [64]. 


Fig. (9). The N-terminal RNA-binding domain (residues 49-178) 
of SARS-CoV N protein determined by NMR spectroscopy 
[redrawn from ref. 64]. 


See [ref. 64] for further details about the structure of the N-terminal 
RNA-binding domain of SARS-CoV N protein. 


Membrane (M) Protein 


The N-terminal domains of coronavirus membrane (M) 
proteins are localized on the viral surface and the C-terminus 
is exposed to the interior face of the virion. Furthermore, the 
predicted M protein of SARS-CoV contains a hydrophobic 
transmembrane domain (residues 12 to 37), which is similar 
to the membrane (M) proteins of other coronaviruses. A 


Current Pharmaceutical Design, 2006, Vol. 12, No. 35 4547 


highly conserved amino acid sequence SMWSEFNPE, imme- 
diately following this hydrophobic domain, exists in the 
SARS-CoV M protein [1]. 


The coronavirus M protein is the most abundant struc- 
tural protein in virions and the key component in viral as- 
sembly and morphogenesis [65,66]. The N/M interactions 
that might be responsible for the encapsidation of the viral 
nucleocapsid into the budding virions were reported in vitro 
for MHV [67] and TGEV [68], and it was identified that the 
carboxyl terminuses of both M and N proteins are involved 
in this interaction. By employing mammalian two-hybrid 
system, He ef al. [69] investigated the N/M protein-protein 
interactions in SARS-CoV. The results identified that the 
SARS-CoV N/M interaction does take place in vivo and a 
stretch of amino acids (residues 168-208) in SARS-CoV N 
protein may be critical for such protein—protein interactions. 
The same region has also been found to be required for 
SARS-CoV N protein multimerization [55], which suggests 
that this region might be crucial in maintaining correct con- 
formation of SARS-CoV N protein for self-interaction and 
interaction with the M protein. While Luo et al. [70] re- 
ported a conflicting result that the C-terminal region (resi- 
dues 351-422) of SARS-CoV N protein and the C-terminus 
(residues 197-221) of M protein might be involved in their 
interactions by using yeast two-hybrid and surface plasmon 
resonance techniques. Sequence analysis revealed that these 
two fragments are highly charged at neutral pH, suggesting 
that their interactions might be of electrostatic attraction. 


Meanwhile, the M protein is also involved in the assem- 
bly of virus-like particle (VLP) together with the small en- 
velope (E) protein [71], which would be vital for assembly 
and budding of the virions. In addition, the M proteins of 
coronaviruses are invariably glycosylated near the N termi- 
nus. Group | and group 3 coronaviruses are N-glycosylated, 
whereas those of group 2 viruses are O-glycosylated [72]. 
The predicted structure of SARS-CoV M protein has an 
NGT sequence profile near its N-terminus, which suggests 
that it might be also N-glycosylated at position 4 [1]. 


Small Envelope (E) Protein 


Small envelope (E) proteins are structurally conserved 
within different coronavirus groups, yet exhibit little se- 
quence similarity among these groups [1,73]. In general, 
coronavirus E proteins are small proteins (varying in size 
from 76 to 109 amino acids), with an unusually long hydro- 
phobic stretch (25-30 residues) located in between hydro- 
philic N and C terminus (~8 and ~ 40 residues respectively). 
Raamsman et al. [74] and Corse and Machamer [75] char- 
acterized E proteins of MHV and infectious bronchitis virus 
(IBV), respectively. Both of those convincing studies re- 
vealed that E protein localizes in intracellular membrane 
with its C-terminal domain extending to cytoplasmic region 
in the infected cell and in the virion toward the interior. The 
hydrophobic N-terminal two-third region of the E protein is 
buried within the membrane. Maeda ef al. [76] represented a 
detailed membrane topology model of MHV E protein, in 
which the E protein spans the membrane twice with its N- 
terminus near the cytoplasmic side, this indicates that no part 
of MHV E protein is exposed on the virion exterior or lumi- 
nally in the infected cells. 
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SARS-CoV E protein has been successively expressed 
and purified from E coli, and its primary 3D model (Fig. 10) 
according to the protein secondary structure prediction was 
built [77], the results showed that SARS-CoV E protein 
shares the same overall structural characteristics although 
only 17%-23% identical with other coronaviridea E proteins. 
The N-terminus consists of the first 11 amino acids, which 
seems to integrate into the membrane directly due to the lack 
of any predicted cleavage site in the SARS-CoV E protein 
sequence. The hydrophobic transmembrane (TM) segment 
ranges from residues 12 to 34 and adopts an o-helix confor- 
mation inserting into the lipid bilayer well. The two short B- 
sheets are composed of residues 45 to 51 (B-sheet I) and 
residues 55 to 61 (B-sheet ID), respectively; B-sheet I seems 
to form hydrogen bonds with the surface of the lipid bilayer. 
Furthermore, the hydrophilic C-terminal tail (residues 62-76) 
is predicted to expose to the cytoplasmic side. The unusual 
length of the hydrophobic segment of SARS-CoV E protein 
has posed a problem with respect to assigning the topology 
to the protein. Arbely et al. [73] presented a detailed struc- 
tural model for SARS-CoV E protein, which determined the 
topology of the protein and the effects upon the lipid bilayer 
thereof. The results showed that SARS-CoV E protein con- 
tains an unusually short palindromic transmembrane helical 
hairpin around a previously unidentified pseudo-center of 
symmetry, a structural feature that seems to be unique to 
SARS-CoV. In addition, the hairpin structure can deform 
lipid bilayer by increasing their curvature, which might 
provide a molecular explanation of the vital role for E pro- 
tein in coronavirus budding. 


Outside 


N-Terminal 


C-Terminal 


Fig. (10). A primary 3D model of SARS-CoV E protein. 


The transmembrane segment was embedded in the POPC lipid 
bilayer, the two short B-sheets were represented by ball-and-stick 
model. Adapted from reference [77]. 


It has been reported that coronavirus E proteins have 
multiple biological functions. MHV E protein can induce 
apoptosis in E protein-expressing cells, and this MHV E 
protein-mediated apoptosis can be suppressed by a high level 
expression of Bcl-2 oncogene, indicating that the apoptosis 
pathway is caspase-dependent [78]. Meanwhile, E protein 
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also plays an important role in some coronaviruses replica- 
tion, such as TGEV [79] and MHV [80]. Another major 
biological function of E protein is about its pivotal associa- 
tion with coronavirus envelope assembly [71]. The forma- 
tion of viral core, envelope and the assembly of virus parti- 
cle would involve the specific interactions between the viral 
structural proteins and the host membrane components, 
among the virus structural proteins and between the viral 
proteins and viral RNAs. The extensive cellular studies on 
TGEV, MHV and IBV have revealed that coronavirus as- 
sembles at the pre-Golgi membranes of the intermediate 
compartment (IC) early in infection and in the rough endo- 
plasmic reticulum (RER) at late period of the infection [81- 
83]. Unlike most of the other envelope RNA viruses, coron- 
aviruses employ a nucleocapsid-independent strategy to 
drive virus particle assembly and budding. Coexpression of 
coronavirus M protein and E protein in intact cells was ini- 
tially shown to be required for the production of virus-like 
particles (VLPs), while expression of M protein alone does 
not produce VLPs [84]. Furthermore, it was demonstrated 
that expression of E protein alone results in the release of E 
protein vesicles [85]. The crucial role of E protein in viral 
envelope assembly was also indicated by some other studies 
on MHV and TGEV [71, 86]. Although the studies of differ- 
ent coronavirus groups showed that possession of the M and 
E proteins is a minimal requirement for the assembly of viral 
particles, the demonstration that E and M proteins are suffi- 
cient for the assembly of SARS-CoV envelope is still im- 
perative for the rational treatment of this deadly virus. A 
communication [87] reported the successful formation and 
isolation of SARS-CoV-like particles through the co- 
expression of E and M proteins in insect cells and these 
virus-like particles (VLPs) can further incorporate S pro- 
teins. Additionally, coronavirus E proteins are candidate 
members of virus-encoded proteins that form ion channels in 
considering their small size and hydrophobic nature. Wilson 
et al. [88] demonstrated that SARS-CoV E protein does form 
ion channels in planar lipid bilayer, and these channels are 
more selective for monovalent cations than monovalent 
anions. Meanwhile a recent study reported that SARS-CoV 
E protein could induce modification of cell membrane per- 
meability [89], thereby favoring virus replication and pro- 
moting the translation of viral mRNAs during the virus 
infection due to the alterations of ion concentration in the 
cytoplasm of virus-infected cells. This result has evidently 
supported a critical role of E protein for the replication of 
coronaviruses. 


Non-Structural Proteins (nsp’s) 


Analogous to other coronaviruses, the 5’-terminal 2/3 of 
the SARS-CoV genome involves the viral replicase gene 
(ORFS la and 1b), which encodes two large polyproteins, 
ppla and pplab. Expression of pplab involves ribosomal 
frameshifting into the —1 frame just upstream of the ORF la 
translation termination codon. Proteolytic processing of 
these polyproteins are usually mediated by virus-encoded 
proteases thus producing a series of non-structural proteins 
(nsp), some of which are responsible for replication of the 
viral genome and transcription of a nested set of subgenomic 
mRNAs (sg mRNA) to express all the ORFs downstream of 
ORF 1b [8]. SARS-CoV is predicted to have two proteases 
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for polyproteins processing, a papain-like cysteine protease 
(termed as PL2””, represented by nsp3) and a 3C-like prote- 
ase (termed as 3CL*° or M?®, represented by nsp5)[5]. Un- 
like other coronaviruses, SARS-CoV does not have an 
ortholog of another papain-like protease (PL1""°). The PL2?” 
and 3CL*° of SARS-CoV are predicted to cleave the poly- 
proteins at 3 and 11 sites respectively thus totally generating 
16 non-structural proteins [5,90,91]. 


The structures and potential functions of several non- 
structural proteins of SARS-CoV have been characterized. 
The solved crystal structure of SARS-CoV nsp9 (Fig. 11) 
showed that nsp9 is a single-stranded RNA-binding protein 
with a novel oligosaccharide/oligonucleotide fold-like struc- 
ture [92]. In detail, the structure of SARS-CoV nsp9 has a 
compact corn-shaped B-barrel comprising seven B—strands 
and one o-helix, flanked by a C-terminal o-helix and N- 
terminal extension. Dynamic light scattering and analytical 
ultracentrifugation experimental results indicated that nsp9 
exists as homodimer in solution at millimolar concentration 
grade [93], which is probably biologically important given 
the hydrophobic nature of the interacting surface and the 
conservation of amino acids involved in dimerization. Due 
to the crucial role of RNA-dependent RNA polymerase in 
the virus cycle and the success of polymerase inhibitors in 
the treatment of various viral infections [94,95], the homol- 
ogy model of SARS-CoV RdRp, represented by nsp12, was 
recently built for charactering the potential polymerization 
mechanism of SARS-CoV and putative inhibitor design (Fig. 
12) [96]. The result showed that the SARS-CoV nsp12 folds 
in a classic “right hand” shape with motifs in which the 
characteristic finger, palm and thumb sub-domains can be 
recognized. The finger domain consists of three helices, the 
RdRp unique helix-loop-helix and two stranded B-sheets. 
The palm domain is composed of two helices and the 
B-hairpin structure, which involves the two catalytic aspar- 
tates responsible for the nucleotide transfer reaction. The 
thumb domain consists essentially of two a-helices and a 
large loop, and the thumb and finger domains are intercon- 
nected with loops. The active center is buried in the center of 
the protein. Since SARS-CoV RdRp plays a pivotal role in 
virus replication and its inhibition will not cause undesirable 
side effects during therapy, it has been regarded as an ex- 
cellent target for anti-SARS drug discovery. Based on the 
three-dimensional model, potential nucleoside analog and 
non-nucleoside inhibitors of SARS-CoV RdRp have been 
proposed by molecular docking [97]. 


Another non-structural protein that is likely to be impor- 
tant for viral replication is the SARS-CoV helicase (also 
called nsp13). Both Tanner et al. [98] and Ivanov et al. [99] 
reported that recombinant SARS-CoV nsp13 has multiple 
enzymatic activities, including both RNA and DNA duplex- 
unwinding activities with a 5’ to 3’ polarity, NTPase, dNT- 
Pase, and an RNA 5’-triphosphatase activity. These reported 
functional and structural properties of the SARS-CoV heli- 
case may provide an opportunity for selective inhibitors 
design and the future development of suitable helicase-targe- 
ted anti-SARS drugs, considering that promising NTPase/ 
helicase inhibitors have been tested for the treatment of 
herpes simplex virus [100,101] and hepatitis C viral (HCV) 
infections [102]. For the remaining SARS-CoV non-struc- 
tural proteins produced from polyproteins ppla and pplab, 
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putative activities have been predicted based on the presence 
of functional domains in their sequences or by their struc- 
tural similarities to other proteins, e.g. the N-terminal do- 
main of SARS-CoV nsp14 has been predicted to be a 3’-to- 
5’ exonuclease (ExoN) that belongs to the DEDD subfamily 
[90], a “nidovirus-specific” replicase domain in the C-ter- 
minal part of nsp15 is in association with a poly (U)-specific 
endoribonuclease (XendoU) [90], and SARS-CoV nsp16 has 
been confirmed as 2’-O-methyltransferase (2’-O-MT), which 
is supported by the presence of the conserved tetrad of resi- 
dues K-D-K-E essential for mRNA cap-1 (mGpppNm) for- 
mation [103]. 


Fig. (11). Ribbon representation of the crystal structure of 
SARS-CoV nsp9. 


The protein has a compact corn-shaped B-barrel comprised of 
seven B-strands and one o-helix flanking by a C-terminal o-helix 
and N-terminal extension, and exists as homodimer in the crystal. 
Adapted from reference [92]. 


Fig. (12). Ribbon diagram of SARS-CoV RNA-dependent RNA 
polymerase (nsp12) [redrawn from ref. 96]. 


The characteristic fingers, palm and thumb domains can be recog- 
nized from the “right-hand’’-like fold structure. 
CONCLUSIONS 


A highly contagious illness called severe acute respira- 
tory syndrome (SARS) ever broke out in China and quickly 
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spread to more than 30 other countries almost two years ago. 
The rapid transmission and the high mortality rate (nearly 
10%) make SARS a global threat. Although the SARS dis- 
ease has been controlled by conventional measures such as 
rapid detection, infection control, isolation, quarantine, 
contact tracing, no efficacious therapy and drugs against 
SARS are available to date. SARS coronavirus (SARS-CoV) 
has been identified as the etiological agent responsible for 
this infectious disease, and SARS-CoV genome is predicted 
to contain 14 functional open reading frames (ORFs). The 
two large 5'-terminal ORFs, la and 1b, constitute the repli- 
case gene encoding the proteins complex (e.g. the genome- 
encoded proteases and a series of non-structural proteins) 
required for viral RNA transcription and replication, and the 
remaining twelve ORFs encode the four key structural pro- 
teins, the spike (S) protein, the nucleocapsid (N) protein, the 
membrane (M) protein and the small envelope (E) protein, 
and several accessory proteins. Characterization of the 
structures and functions of these major proteins involved in 
SARS-CoV has become an imperative project, which can 
afford us an overall picture for understanding the assembly 
and maturation mechanism of SARS-CoV and exploring the 
feasible therapy and drugs for the treatment of this disease. 
Actually, extensive studies concerning these pivotal proteins 
encoded by SARS-CoV genome have been performed. This 
review gives a brief summary of the recently substantial 
scientific progress that has been made towards understand- 
ing the structural and functional characters of major proteins 
involved in SARS-CoV and the corresponding drug devel- 
opment. These studies would help us understand the mo- 
lecular mechanism of SARS-CoV replication and infection, 
and provide valuable information for rational treatment with 
this disease. 
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ABBREVIATIONS 

SARS-CoV = Severe acute respiratory syndrome 
coronavirus 

TGEV = Transmissible gastroenteritis coronavirus 

HCoV = Human coronavirus 

MHV = Mouse hepatitis virus 

IBV = Infectious bronchitis virus 

3CL” = 3C-like protease 
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S protein = Spike glycoprotein 
ACE2 = Angiotensin-converting enzyme 2 
N protein = Nucleocapsid protein 
CypA = Cyclophilin A 
hnRNP Al = Heterogeneous nuclear ribonucleoprotein 
Al 
M protein = Membrane protein 
VLP = Virus-like particle 
E protein = Small envelope protein 
nsp = Non-structural protein 
RdRp = RNA-dependent RNA polymerase 
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